Identifying species network features from gene tree quartets under the coalescent model
نویسنده
چکیده
We show that many topological features of level-1 species networks are identifiable from the distribution of the gene tree quartets under the network multi-species coalescent model. In particular, every cycle of size at least 4 and every hybrid node in a cycle of size at least 5 is identifiable. This is a step toward justifying the inference of such networks which was recently implemented by Soĺıs-Lemus and Ané. We show additionally how to compute quartet concordance factors for a network in terms of simpler networks, and explore some circumstances in which cycles of size 3 and hybrid nodes in 4cycles can be detected.
منابع مشابه
Quartet Inference from SNP Data Under the Coalescent Model
MOTIVATION Increasing attention has been devoted to estimation of species-level phylogenetic relationships under the coalescent model. However, existing methods either use summary statistics (gene trees) to carry out estimation, ignoring an important source of variability in the estimates, or involve computationally intensive Bayesian Markov chain Monte Carlo algorithms that do not scale well t...
متن کاملIdentifying the rooted species tree from the distribution of unrooted gene trees under the coalescent.
Gene trees are evolutionary trees representing the ancestry of genes sampled from multiple populations. Species trees represent populations of individuals-each with many genes-splitting into new populations or species. The coalescent process, which models ancestry of gene copies within populations, is often used to model the probability distribution of gene trees given a fixed species tree. Thi...
متن کاملIdentifiability and Reconstructibility of Species Phylogenies Under a Modified Coalescent
Coalescent models of evolution account for incomplete lineage sorting by specifying a species tree parameter which determines a distribution on gene trees. It has been shown that the unrooted topology of the species tree parameter of the multispecies coalescent is generically identifiable. Moreover, a statistically consistent reconstruction method called SVDQuartets has been developed to recove...
متن کاملAn algorithm for computing the gene tree probability under the multispecies coalescent and its application in the inference of population tree
MOTIVATION Gene tree represents the evolutionary history of gene lineages that originate from multiple related populations. Under the multispecies coalescent model, lineages may coalesce outside the species (population) boundary. Given a species tree (with branch lengths), the gene tree probability is the probability of observing a specific gene tree topology under the multispecies coalescent m...
متن کاملGene tree distributions under the coalescent process.
Under the coalescent model for population divergence, lineage sorting can cause considerable variability in gene trees generated from any given species tree. In this paper, we derive a method for computing the distribution of gene tree topologies given a bifurcating species tree for trees with an arbitrary number of taxa in the case that there is one gene sampled per species. Applications for g...
متن کامل